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ABSTRACT 

Engineering design and optimization efforts using 
computational systems rapidly become resource inten- 
sive. The goal of the surrogate-based approach is to 
perform a complete optimization with limited resources. 
In this paper we present a Bayesian-validated approach 
that informs the designer as to how well the surrogate 
performs; in particular, our surrogate framework pro- 
vides precise (albeit probabilistic) bounds on the errors 
incurred in the surrogate-for-simulation substitution. 
The theory and algorithms of our computer-simulation 
surrogate framework are first described. The utility of 
the framework is then demonstrated through two illus- 
trative examples: maximization of the flowrate of fully 
developed flow in trapezoidal ducts; and design of an 
axisymmetric body that achieves a target Stokes drag. 


INTRODUCTION 

Computational design efforts typically consist of a se- 
quence of optimization problems characterized by dif- 
ferent combinations of design parameters and objective 
functions. Each design problem is solved by means of 
a nonlinear optimization algorithm. Numerical simu- 
lations can be incorporated into the optimization algo- 
rithm either by direct-insertion or through a surrogate- 
based approach. In direct-insertion, the large-scale sim- 
ulation is inserted into the optimization algorithm as a 
function call. In the surrogate-based approach, a sur- 
rogate (a simple model such as a response surface) for 
the simulation is first constructed, and the surrogate 
is then incorporated into the optimization algorithm in 
place of the full simulation. 


Many optimization problems based upon direct- 
insertion are hindered, first, by the cost of each inquiry 
to the simulation code, and second, by the algorith- 
mic difficulties of merging the large-scale simulation 
with the mathematical programming algorithm. Fur- 
thermore, the high cost of objective-function evalua- 
tions for the direct-insertion method reduces the inter- 
activity of the design process. Finally, the number of 
objective-function evaluations required to achieve the 
optimal solution is typically not known a priori, and 
computational resources may thus be exhausted before 
a feasible design is achieved. The latter is especially 
pronounced in the case of multidisciplinary optimiza- 
tion (Newman et al., 1992). 

To circumvent these difficulties, we pursue a 
surrogate-based approach to optimization (McKay et 
al., 1979; Sacks et al., 1989; Barthelemey and Haftka, 
1993; and Ye§ilyurt and Patera, 1995). The surrogate 
approach offers a number of advantages. First, by con- 
struction, surrogate evaluations require very little com- 
putational effort, and are thus easily incorporated into 
optimization procedures. Second, the very low compu- 
tational requirement permits a more complete optimiza- 
tion by assuring that computational resources will not 
be exhausted before the design is complete. Third, sur- 
rogates create highly interactive and flexible design en- 
vironments, allowing multiple optimization tasks char- 
acterized by different design parameters to be pursued 
more easily. 

As regards disadvantages, the most important draw- 
back is that surrogate-based optimization introduces 
a new source of error. A surrogate validation strat- 
egy and appropriate error norms must be developed to 
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quantify the discrepancy between the surrogate and the 
actual simulation, and to estimate the errors in system 
predictability and optimality. The former provides a 
bound on the behavior of the actual objective function 
in the neighborhood of the surrogate minimizer, while 
the latter provides a bounding region in which the ac- 
tual minimizer lies. 

In this paper, we first describe the optimization prob- 
lem which is central to the remainder of the paper. We 
then briefly review our baseline surrogate framework, 
and offer references to more detailed descriptions. Next, 
we present several improved surrogate algorithms that 
have been developed. In the final section of the paper, 
we apply the surrogate framework to two design op- 
timization problems: optimization for pressure-driven 
flow in a trapezoidal duct; and shape optimization for 
Stokes flow past an axisymmetric body. 

OPTIMIZATION 

The minimizer to the exact optimization problem is 
given by 

p* =argmin|5(p) - A| , (1) 

pen 

where p is the vector of M design inputs which lie in the 
input (or “design”) domain Q C 1R M , <S(p) is the input- 
output relation, and A is the target value. This problem 
is also referred to as a “discrimination” problem (Seber 
and Wild, 1989). More general objective functions are 
examined in Ye§ilyurt (1995), and Ye§ilvurt and Patera 
(1995). 

As already described, direct insertion of the full sim- 
ulation for <S(p) in equation (1) is impractical. Instead, 
a surrogate, <S(p), for the simulation is inserted into the 
optimization problem. The minimizer for the resulting 
surrogate-based optimization problem is given by 

p* =argmin|5(p) - A| . (2) 

pSQ 

The optimization then proceeds as for direct-insertion, 
but the surrogate is evoked instead of the simulation. 

The advantages of using a surrogate framework for 
optimization have already been described. The issues of 
predictability and optimality must be addressed (Bohlin, 
1991). For predictability , we are concerned with how 
the actual simulation behaves in the vicinity of the 
surrogate-predicted minimizer, p*. We seek to be able 
to bound |5(p ~ p*) — <S(p)|, and also to ensure that 
this bound is acceptably small. The proximity of p* 
to p*, the actual optimizer, (i.e. optimality ), is also 
important, although not as critical in this context as 
predictability. The validation strategy which yields pre- 
dictability statements is addressed next. 


SURROGATE FRAMEWORK 

The Bayesian-validated surrogate methodology pre- 
sented here has the distinguishing attribute of inte- 
grating a complete and rigorous validation framework. 
The outcome of the validation step is fully incorporated 
into the a posteriori purposive analysis, and hence the 
predictability of any optimization is a natural exten- 
sion of the validation. The validation itself is based 
on the results of order statistics (David, 1981) where 
the validation sample size (N va ) is determined a pri- 
ori. In this section, we briefly describe the baseline 
surrogate-validation algorithm and a posteriori error 
analysis framework. A full description, and additional 
algorithms, can be found in Paraschivoiu (1995) and 
Otto et al. (1995). 

A Baseline Approach 

We first define a model prediction error estimator: 

U = max | Rj — <S(p)| , (3) 

where Ap’ a is a random set of N va points in fl dis- 
tributed according to an importance function p( p). 
Then, given a vector of “uncertainty” parameters (0) 
we can calculate the required sample size. Two dif- 
ferent predictability statements have been developed: 
“Prediction Region” (VI Z) and “Proximal-Candidate” 
(VC). The former gives a predictability statement for 
a region neighboring the surrogate-based optimum, p*, 
while the latter evaluates the predictability of a specific 
nearby candidate design. 

The validation sample sizes for each approach are 

VIZ : N va = lne 2 / ln(l — £i) , (4) 

VC-'. £-2 = — — — fl - (1 -i 1 ) N ' ,a+1 ) , (5) 

rj(N va + 1) V " ) ’ ■ 

where 0 = (ei,e 2 ) £ (0, l) 2 in equation (4) and 0 = 
(rj,e 2 ) £ (0, l) 2 in equation (5). 

A Posteriori Error Analysis 

Two approaches to the error analysis of our surrogate- 
predicted optimum points are presented. The first is 
a “Prediction Region” approach and the second is a 
“Proximal-Candidate” based approach. Both of these 
analyses seek to address the predictability concerns al- 
ready described. 

Prediction Region. The inputs to the “Prediction 
Region” error analysis are the model prediction error, 
U, from the surrogate validation, and the surrogate- 
predicted optimizer p*. We introduce a parameter, ij , 
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£\ < // < 1. and construct a region, TZ such that 

[ P(p)dp = V ■ (6) 

Jn 

Here, TZ is typically chosen as that region, TZ' , of p- 
measure p which minimizes A max , the maximum dis- 
tance, A(p',p*), between all points p' G TV and p*. 
The distance A(-, •) can be any valid distance between 
p' and p*. 

Introducing a surrogate sensitivity parameter 

8 = max|5(p) - <S(p*)| , (7) 

p eiz 

the following statement can be made: There exists 

many points, p, in the region TZ such that, with prob- 
ability greater that 1 — £ 2 , 

|<S(p) - M < |<5(p*) - A| +w , (8) 

where w = U + <5 is the predictability- gap. This state- 
ment provides a probabilistic bound on the actual sys- 
tem performance, |<S(p) — A|, for points, p, near the 
surrogate-predicted optimum, p* . Recall that the num- 
ber of simulations to obtain U is given by Equation (4). 

Proximal-Candidate. In many problems it may be 
difficult, or even impossible, to explicitly construct pre- 
diction regions TZ C 12 as described in the last sec- 
tion. Another approach is to only observe the pre- 
diction region (and not construct the region) through 
sample candidate designs, P*, which, with a prescribed 
probability, will exhibit actual performance within the 
predictability gap of the surrogate-predicted optimal 
performance. We refer to this method as a “Proximal- 
Candidate” approach to a posteriori analysis (Otto, 
1995, and Otto et al., 1995). 

The two inputs to the validation are p and £ 2 , where 
0 < p,c 2 < 1. In the a posteriori analysis, we introduce 
that region TZ C ft which satisfies Equation (6) and 
minimizes A max , and sample a P* G TZ according to 
the probability density 

Pn( p) = -p( p)|re Vp G TZ . (9) 

V 

Using acceptance-rejection techniques, the region TZ 
can be measured and sampled without actually con- 
structing the region. 

The a posteriori error statement (similar to the “Pre- 
diction Region” result) is 

Pr{|S(P*) - A| < |5(p*) — A| T w'} > 1 - e 2 ,, (10) 

where w' = U + S' is the predictability gap and S' = 
|«S(P) — <S(p*)|. In words, Equation (10) says that, with 


probability greater than 1 — £ 2 , there will exist a can- 
didate design P* for which actual system performance, 
|<S(P*) — A|, is within uj' of the surrogate-predicted 
optimum objective value, |<S(p*) — A|. The number of 
validation points needed is given by Equation (5). 

Finally, we remark that the “uncertainty” vector 0 
measures the degree of precision with which the surro- 
gate must be validated. As £ 1 , £ 2 , and p decrease, the 
number of validation points increases, and more infor- 
mation as to the quality of the surrogate is extracted. 
The probabilistic framework results in a fixed validation 
sample size, allowing the designer to tune the “uncer- 
tainty” vector to the available resources. 


IMPROVED ALGORITHMS 

In this section we describe several improvements to the 
baseline algorithm that have been developed. A full 
description of the additional algorithms can be found 
in Paraschivoiu (1995). 


Elemental Algorithm 

One of the shortcomings of the baseline approach to 
validation is the global nature of the model prediction 
error U. That is, a large error at one point of the in- 
put space has a dominating influence over the entire 
input space. One way to address this drawback is to 
divide the input domain, 12, into L non-overlapping, 
elements (or sub-domains) (Paraschivoiu, 1995; Otto et 
al., 1995). Validation is performed independently on 
each sub-domain yielding a local model prediction er- 
ror, (U e , t = 1,. ... ,L), associated with each element, 
fl e . The U l for each element is then used in the a pos- 
teriori error analysis. 

For the elemental algorithm, the size of the validation 
sample for each element, N vat ’, is 


VIZ : 


VC. : 


KTva l. _ In (1 - (1 - £- 2 ) 1/L ) 
1 - ln(l — Lei) 


(11) 

( 12 ) 


(i-d-^r 


)' (I - V ""' 1 


The locality improvement in the model prediction er- 
ror estimator is obtained with a small increase in the 
number of validation points for small numbers of ele- 
ments. For a very large number of elements, even fewer 
validation points are needed than for a single element. 
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Cross-Validation, Re-Sampling 

A natural extension of the elemental algorithm for 
the VIZ approach is an intra-model, cross-validation 
procedure (Paraschivoiu, 1995) which allows the use of 
the available simulation data for both construction and 
validation. Indeed, due to the elemental partitioning, 
the validation data from one element can be used for 
the construction of a surrogate model in another el- 
ement. The only restriction is an ordered dependence 
requirement, which means that validation data can only 
be used for construction in higher numbered elements 
(with respect to some ordering). 


Model-Sequential Approach 

To achieve a desired target level for the model pre- 
diction error estimator, a model-sequential approach is 
pursued. We assume that we are given a sequence of J 
models, Sj{ p), j = 1 and a desired model pre- 
diction error, a > 0. The sequential algorithm validates 
each subsequent model with the same points as used 
for the previous model plus one new random validation 
point. If, for the j th model, the model prediction error 
estimator is less than or equal to a, then that model 
is selected. If, on the other hand, all the models have 
been tested, and none has satisfied the desired model 
prediction error, then the last model is selected. Re- 
spectively, a or U J then serves as the model prediction 
error estimator in the a posteriori analysis. The initial 
number of validation points is given by 

= >” ( , _(V- g| ).0 / ‘"d - ft) - 1 - (13) 

An Adaptive, Model-Sequential Approach 

To integrate construction into the model-sequential 
approach, an adaptive algorithm is considered. In this 
algorithm, simulation evaluations used for the valida- 
tion of previous models are incorporated into the sur- 
rogate construction, and a new set of validation points 
is chosen for each subsequent validation. The size of 
each sample set (assuming J finite) is 

N va = In (y) / ln(l -ej) . (14) 

The desired model prediction error, a, is used as pre- 
viously described in the model-sequential approach. In 
practice, this approach is very powerful for the con- 
struction of surrogates that achieve the desired model 
prediction error. However, the cost of evaluating a new 
set of validation points at each step is expensive. This 
drawback is currently under investigation. 


We conclude by commenting that the elemental par- 
titioning can be incorporated in both the sequential and 
the adaptive methods. This combination offers signif- 
icant advantages in obtaining the desired local model 
prediction error with a minimal number of validation 
points. Indeed, in some elements, the desired elemen- 
tal model prediction error can be reached faster than 
in others, and the iterative process stopped in those 
elements. 


APPLICATIONS 

Flow in a Trapezoidal Duct 

As our first illustrative example, we consider fully de- 
veloped laminar fluid flow in a duct of constant trape- 
zoidal cross-section. The optimization problem is to de- 
termine the geometry, for a given cross-sectional area 
A (l , that achieves a target flowrate. In this problem the 
pressure gradient is fixed. The trapezoidal geometry 
is defined by two inputs; the angle between the base 
and the side, and the height normalized with respect 
to the base length. The non-dimensional form (') of 
the Poiseuille flow equation inside the duct (V), with 
non-slip boundary conditions on the walls (B) is 

( v3 “ W 1 '°r' < 15 > 

[ u = 0 on B . 

Using non-dimensional variables we relate the nor- 
malized flowrate to the geometrical properties. We first 
write the pressure drop (A P) as a function of the fric- 
tion factor. Knowing that, for laminar flow, the fric- 
tion factor is simply a shape-dependent constant g over 
the Reynolds number, we can obtain an expression for 
the dimensional flowrate; here the Reynolds number is 
taken with respect to the hydraulic diameter of the di- 
mensionless trapezoidal section, D/,. The normalized 
flowrate can be written 


Q 


Q 


2AP A 2 

~ A o 


£l = ± 

gA 2 A ’ 


(16) 


where L is the length of the duct, p is the dynamic vis- 
cosity of the fluid, and u is the non-dimensional cross- 
sectional average velocity. 

Equation (16) is useful in incorporating prior infor- 
mation on particular geometries through different val- 
ues of g. In addition, we see that our optimization 
problem is simplified by the absence of scale depen- 
dency, that is, our solution is independent of the cross- 
sectional area, A 0 , of the dimensional duct. The opti- 
mization problem is a simple inverse-simulation type: 
find the input geometry (h*,0*) that minimizes the dif- 
ference between the target A and the numerical solution 
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of the normalized flowrate . That is, 


0.014 


(/&*,#*) = arg min \S(h,0) — A| (17) 

(h,6)en 

i x Qdesired i <-./£ ^ 

where A = 0 A p and o(h.tl) = — - . 

ff A 0 2A 

To obtain the velocity field in the trapezoidal chan- 
nel, a numerical approximation scheme is used to solve 
Equation (15). In particular, we employ a second or- 
der finite element method on triangular elements in M 2 
with automatic mesh generation and conjugate gradi- 
ent iterative solution. For all geometries, approximately 
4500 degrees-of-freedom are used, and each evaluation 
of the flowrate takes approximately 10 seconds on a 
single processor of a HP9000/735 workstation. The nu- 
merical results can be considered to have a normalized 
£oo error norm less than 5 x 10 — 4 , which motivates us to 
consider these results to be without “noise” (Ye§ilyurt 
et ah, 1995). 

Surrogate approach. The design (or input) space 
covers different trapezoidal geometries ranging from a 
rectangle to an isosceles triangle. Small clearance limits 
are included to yield the following input space: 

ft = j(h,#)|0.01 <h< min(tan#,4.0); 0.05 <9< ^ j . 

(18) 

In all cases the importance function is taken to be uni- 
form. We first apply the baseline algorithm to find the 
trapezoidal geometry that achieves a target normalized 
flowrate. In this first optimization exercise, we con- 
sider only variation of the height of the section, while 
fixing the angle of the trapezoid to f ; the corresponding 
“slice” of f l, [0.0, 0.5], is denoted fi'. 

The four steps of the surrogate-based optimization 
are as follows. First, we construct a surrogate based on 
prior information only; in particular, we take g = 53.33 
(the known value for an equilateral triangle) in Equa- 
tion (16). In Figure 1, the normalized flowrate is plot- 
ted versus the dimensionless height, and the surrogate 
is shown by the solid line. Second, we use this surro- 
gate to find the height (p*) corresponding to the de- 
sired target flowrate, A = 0.01. Third, we wish to 
know how well our surrogate has performed. To ob- 
tain this information, validation is carried out. The 
number of validation points, that is the number of cal- 
culations of the flowrate for random height values, is 
obtained from Equation (4). For s 1 = e 2 = 0.2, eight 
validation points are needed. These points are repre- 
sented by circles on Figure 1. The maximum differ- 
ence between the surrogate and the exact simulation 
value of the normalized flowrate over all the validation 
points determines the model prediction error estimator, 



Figure 1: Surrogate for duct flow example using the 
“baseline” algorithm. 


U. Fourth, and finally, we can make the following pre- 
dictability statements: With probability > 0.8 (1 —£ 2 ); 
(a) the surrogate error is less than U = 15.8% over 
at least 80% (1 — £ 1 ) of the input domain O', and, (b) 
there exists many points in the prediction neighborhood 
of the surrogate optimum geometry p*, where the 
exact simulation is within 34.6% of the target flowrate, 
normalized by the maximum value of the surrogate. 

The predictability statement indicates (a) quantita- 
tively how good the surrogate is over the entire domain, 
and (b) how well it performs in the neighborhood of the 
actual optimum. The neighborhood is indicated on the 
figure by V\. This example is useful to demonstrate 
not how well we can construct a surrogate, but rather, 
what can be said about the surrogate once it is con- 
structed. In this case, we see that our model does not 
perform very well. 

One way to decrease the error estimate, for a given 
construction, is to decompose the input space into ele- 
ments and validate the surrogate in each element. We 
divide the input space into eight different elements 
(shown in Figure 2). In addition, we use the cross- 
validation algorithm to simultaneously construct and 
validate as follows: First, element one is constructed 
using prior information (no flow at h = 0) and only one 
construction point. Second, this surrogate is validated 
with three validation points. Third, the points used to 
validate the surrogate in the current element are used 
to construct the surrogate in the next element with a 
quadratic extrapolation. The second and third steps 
are repeated until surrogates are constructed and vali- 
dated in all of the elements. This procedure results in 
a surrogate with a local U of less than 0.8% in each ele- 
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Figure 2: Surrogate for duct flow example using the 
“Elemental” algorithm. 

ment. This surrogate (solid line in Figure 2) can now be 
used similarly to the baseline algorithm, and surrogate- 
based optimization can be pursued. The predictability 
statement is improved due to the use of the local model 
prediction error. 

Finally, the adaptive algorithm is applied to demon- 
strate the surrogate framework for a two-dimensional 
input space. The “uncertainty” vector, 0, is set to 
0 = ( i , So) = (0.05,0.10). We limit ourselves to 
at most five models ( J = 5) constructed using Shep- 
ard’s algorithm (Shepard, 1968), with which we wish 
to achieve a model prediction error of less than 5% 
(i a = 8.8 x 10 -4 ). From Equation (14), the number 
of validation points needed for each validation step is 
77. The target error was achieved with the second sur- 
rogate and required 127 simulations for construction. 
The first model had 50 construction points and per- 
formed with a 7.8% model prediction error estimate. 
Therefore, a second surrogate was needed. For this sur- 
rogate, the 77 points used to validate the first surro- 
gate were used, along with the original 50 construction 
points, to achieve the target error. 

The surrogate-based maximization of the flowrate 

yields h = 0.99, 6* = 1.55, S(h J*) = 1.76 x 1CT 2 . 
Figure 3 shows isocontours of the normalized flowrate 
given by the surrogate model, the surrogate-predicted 
optimum geometry, and the prediction neighborhood. 
Based on the validation step and the a posteriori er- 
ror analysis, the predictability statements read, with 
probability >0.9: (a) the surrogate error is less than 
a = 5% over at least 95% of the input domain f l(h,9), 
and, (b) there exists many points in the prediction 
neighborhood of the surrogate optimum geometry p*, 



h 

Figure 3: Surrogate for duct flow example using the 
“adaptive” algorithm. 

V\, where the exact simulation is within 8.9% of the 
target flowrate. 

The exact solution to this optimization problem is a 
square cross-section geometry. We can compare this 
with our surrogate-based optimization on Figure 3 and 
observe that they are very close. In addition, the dif- 
ference between the maximum normalized flowrate pre- 
dicted by the surrogate and predicted by the simulation 
is less than 0.1%. This is well within the model predic- 
tion error estimate. For this optimization problem, we 
see that sharp results can be obtained at the expense of 
increased cost. In practice, this cost could be decreased 
by using combinations of the previous algorithms. 

We conclude by acknowledging that this test case falls 
short of completely illustrating the advantages of the 
surrogate framework. For example, due to the relatively 
inexpensive computations, and because the objective 
function is smooth, a direct insertion approach could 
be used. However, for multiple design optimizations, 
direct insertion, even in this case, can be prohibitively 
expensive compared to the surrogate framework. Fur- 
thermore, in practice, most engineering optimization 
studies require expensive simulations (Ye§ilyurt et al., 
1995; Ye§ilyurt and Patera, 1995). 

Shape Optimization 

Many engineering design problems involve not only 
configuration optimization, in which optimal values of 
a small number of parameters (e.g. radius, position, 
angle, etc.) are desired, but also shape optimization, in 
which the optimal form of a curve is desired. Given the 
deterioration of volume-based error estimates present 
for problems with high input dimensions, it would ap- 
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pear that shape optimization problems (described ex- 
actly by an infinite - dimensional input space) would be 
intractable in this framework. This is not the case, how- 
ever, primarily due to two reasons. First, highly irreg- 
ular shapes will typically be non-optimal, allowing for 
truncation of the input space. Second, the remaining 
inputs are typically highly correlated, further reducing 
the effective volume of the input space. 

To demonstrate the surrogate based shape optimiza- 
tion procedure, we examine the Stokes drag over an 
axisymmetric, fore-aft symmetric body. The simula- 
tion for which the surrogates are substituted is a finite 
element solver utilizing an Uzawa nested conjugate gra- 
dient iteration. The inputs to the problem are a set of 
coefficients which describe the shape, b(6), and the out- 
put is the non-dimensional drag, Fd- The design op- 
timization problem is to find the optimal shape, b*(9), 
for a given target drag, A; 


b*{9) = arg min I Fjj — A| . (19) 

b(6)eW + ,b(0)<C(6} 


The general geometry and the physical constraint on 
the optimal profile, C(9 ), are shown in Figure 4. 

The surrogate for the Stokes drag over the body is 
simply the Stokes drag over a sphere of equivalent sur- 
face area. Other surrogates such as response surfaces 
constructed from simulation data could be considered 
instead. The surrogate-based optimization problem is 
identical to Equation (19), but the simulation result, 
/•'/), is replaced by the drag surrogate, F/j. 

The quarter-body profile of the shape in the first 
quadrant describes the full three-dimensional body. 
The shape is expressed as a positive function b(6) G 
W+([0,7r]), where IV + is a set of admissible functions. 
We introduce a basis for W + ([0,7r]) and truncate this 
basis to M terms. The shape function can then be ex- 
pressed as 



Figure 4: Surrogate predicted optimal and proximal 
candidate shapes. 

statistical quantities of the ensemble, such as the mean 
and covariance. Through the mean and covariance of 
the radius-function describing the shape, the designer 
reflects his prejudices as to the class of shapes that will 
contain the optimizer. 

We select e 2 , and A(-,-) and proceed with the 
validation, the surrogate-based optimization, and the 
candidate-based a posteriori error analysis. For this 
problem we choose ?? = .18, and e 2 = -18. The distance 
criterion chosen, A(-, •), is the Hausdorff distance. With 
Equation (5), and our values of p and e 2 , we find that 
30 validation points are needed. The resulting model 
prediction error is U = .15. 

For the design problem, we choose a target drag A = 
304 dynes, and find the surrogate-predicted optimizer, 
b*(9). Based on = .18, we find A max = .35, using 
a “best scaled” geometry for the acceptance/rejection. 
The region in the proximity of the b*{9) is then queried 
for a candidate design B*(9). With this candidate, the 
following predictability statement can be made: With 
confidence greater than .82, 

\F d (B)-X\ < _ 0 .| + p v = 18 ( 21 ) 

A 


M 

bW = 1 + E Aj cos('2j9) . (20) 

j = 1 

The result is that the shape profiles are parameterized 
by A M = {A! , - ■ . , Ajy/}, which acts as the input, p. 
The shapes in this problem are described by 10 inputs 
(M = 10). 

To generate a random ensemble of shapes for valida- 
tion, and for Monte Carlo sampling for the proximal- 
candidate error analysis, a random-shape process in- 
ducing an importance function on the set of coefficients 
is employed (Elishakoff, 1983; Otto, 1995; Otto et ah, 
1995). With this method, certain smoothness condi- 
tions at the endpoints of the shape are enforced, as are 


In words, with confidence greater than .82, the abso- 
lute difference between the actual drag on the candi- 
date design and the target drag, A = 304 dynes, nor- 
malized by the target drag, will be less than or equal 
to .18. The coefficient, /?*, present in the final pre- 
dictability statement results from the scale-invariance 
(self-similarity) in the linear Stokes drag relation. This 
coefficient is near one and has the effect of increas- 
ing the model prediction error, U, from .15 to the 
value .18 present in Equation (21). Three-dimensional 
plots of the surrogate-predicted optimal design and the 
proximal candidate are shown in Figure 5. 

A designer can conclude from this result that there 
is likely to be a design near the surrogate-predicted 
optimal design which will satisfy the target drag re- 
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b*(Q) F D {b*) = A = 304 B*(6) F D (B*) = 312 

Figure 5: The surrogate-predicted optimal shape (left) and the proximal-candidate design (right). The target drag, A, 
and, the simulation-predicted Stokes drag of the proximal-candidate are indicated. 


quirement to within 18%. If this is deemed satisfac- 
tory, the designer can consider the process a success. 
The advantage of this approach is that, at this point, 
the designer can explore other objective functions us- 
ing the same validated surrogate, without the need for 
additional simulation evaluations. 

If the error reflected on the right side of Equation (21) 
is deemed too large, it can be reduced by an improved 
surrogate, which could be accomplished, say, by ded- 
icating a portion of the simulation evaluations to the 
construction of a response surface. A response surface 
would likely introduce a finite S' in (10) (not present 
for the simple surrogate presented here). To reduce S ' , 
a smaller prediction region 1Z (physically) is necessary, 
which requires a smaller e± and, therefore, either more 
validation points or a better p( p) (one that more accu- 
rately anticipates the “location” of the optimizer) . 
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